Shape invariant pitch modification of speech using a harmonic model

نویسندگان

  • Darragh O'Brien
  • Alex I. C. Monaghan
چکیده

We present a simple but e ective approach to pitch modi cation of speech based on a harmonic model. Building on our time-scaling algorithm [1], pitch modi cation applies to a harmonically coded glottal wave estimate derived via a simple inverse ltering technique [3]. The modi ed glottal wave subsequently serves as input to an LPC vocal tract lter and the pitch-scaled speech is generated. Shape invariance is maintained in the glottal wave by exploiting the harmonic nature of the sine waves used to code each frame thus avoiding the need for \pitch pulse onset time" estimation. Furthermore, given its smooth shape it is not necessary to resample the glottal wave spectrum at the new harmonic frequencies. The original spectrum is merely compressed/expanded to produce the desired pitch change.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Shape invariant time-scale modification of speech using a harmonic model

A new and simple approach to shape invariant timescale modi cation of speech is presented. The method, based upon a harmonic coding of each speech frame, operates entirely within the original sinusoidal model [3] and makes no use of \pitch-pulse onset times" used by conventional algorithms. Instead, phase coherence, and thus shape invariance, are ensured by exploiting the harmonic relation exis...

متن کامل

High-Quality Speech Modification Based on Pitch- Synchronous Harmonic and Non-harmonic Modeling of Speech

In this paper, we propose a high-quality speech modification method based on pitch-synchronous harmonic and non-harmonic modeling of speech. In the proposed method, the harmonic and non-harmonic parts of speech are modeled by the sum of sinusoids with frequencies corresponding to pitch multiples and with randomized frequencies, respectively. Then, harmonic and nonharmonic parts are synthesized ...

متن کامل

Enhanced shape-invariant pitch and time-scale modification for concatenative speech synthesis

To preserve shape-invariance when pitch or time-scale modifying sinusoidally modelled voiced speech, the phases of the sinusoids used to model the glottal excitation are made to add coherently at estimated excitation points. Previous methods achieve this by estimating excitation phases at synthesis frame boundaries, disregarding the frequency modulation that may occur between the frame boundary...

متن کامل

An implementation and evaluation of two diphone-based synthesizers for Turkish

This paper presents two diphone based Turkish text-to-speech systems; the first system is realized inside the MBROLA project, a freely available multilingual speech synthesizer and the second system is based on shape invariant harmonic modeling. Both synthesizers use the same parametric representations of two diphone databases (male, female) obtained by processing speech data with a pitch async...

متن کامل

Robust HNR-Based Closed-Loop Pitch and Harmonic Parameters Estimation

An important problem in speech coding framework is model parameters estimation. In most cases parametric speech coding methods do not preserve shape of speech waveform. This fact implies straightforward parameters estimation and analysisby-synthesis method is hardly used. A novel analysis-by-synthesis parameters estimation method in speech coders based on harmonic models presented. We introduce...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999